Topic Detection, a New Application for Lexical Chaining?
نویسندگان
چکیده
This paper discusses a system for online new event detection as part of the Topic Detection and Tracking (TDT) initiative. Our approach uses a single-pass clustering algorithm, which includes a time-based selection model and a thresholding model. We evaluate two benchmark systems: The first indexes documents by keywords and the second attempts to perform conceptual indexing through the use of the WordNet thesaurus software. We propose a more complex document/cluster representation using lexical chaining. We believe such a representation will improve the overall performance of our system by allowing us to encapsulate the context surrounding a word and to disambiguate its senses.
منابع مشابه
Three Knowledge-Free Methods for Automatic Lexical Chain Extraction
We present three approaches to lexical chaining based on the LDA topic model and evaluate them intrinsically on a manually annotated set of German documents. After motivating the choice of statistical methods for lexical chaining with their adaptability to different languages and subject domains, we describe our new two-level chain annotation scheme, which rooted in the concept of cohesive harm...
متن کاملA protocol for constructing a domain-specific ontology for use in biomedical information extraction using lexical-chaining analysis
In order to do more semantics-based information extraction, we require specialized domain models. We develop a hybrid approach for constructing such a domain-specific ontology, which integrates key concepts from the protein-protein– interaction domain with the Gene Ontology. In addition, we present a method for using the domain-specific ontology in a discourse-based analysis module for analyzin...
متن کاملLexical Chains versus Keywords for Topic Tracking
This paper describes research into the use of lexical chains to build effective Topic Tracking systems and compares the performance with a simple keyword-based approach. Lexical chaining is a method of grouping lexically related terms into so called lexical chains, using simple natural language processing techniques. Topic tracking involves tracking a given news event in a stream of news storie...
متن کاملTowards Automatic Content Tagging - Enhanced Web Services in Digital Libraries using Lexical Chaining
This paper proposes a web-based application which combines social tagging, enhanced visual representation of a document and the alignment to an open-ended social ontology. More precisely we introduce on the one hand an approach for automatic extraction of document related keywords for indexing and representing document content as an alternative to social tagging. On the other hand a proposal fo...
متن کاملExperiments on Lexical Chaining for German Corpora: Annotation, Extraction, and Application
Converting linear text documents into documents publishable in a hypertext environment is a complex task requiring methods for segmentation, reorganization, and linking. The HyTex project, funded by the German Research Foundation (DFG), aims at the development of conversion strategies based on text-grammatical features. One focus of our work is on topic-based linking strategies using lexical ch...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2000